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ABSTRACT 

H1 and related linker histones are important both for 
maintenance of higher-order chromatin structure 
and for the regulation of gene expression. The 
biology of the linker histones is complex, as they 
are evolutionarily variable, exist in multiple isoforms 
and undergo a large variety of posttranslational 
modifications in their long, unstructured, NH 2 - and 
COOH-terminal tails. We review recent progress in 
understanding the structure, genetics and post- 
translational modifications of linker histones, with 
an emphasis on the dynamic interactions of these 
proteins with DNA and transcriptional regulators. 
We also discuss various experimental challenges to 
the study of H1 and related proteins, including limi- 
tations of immunological reagents and practical 
difficulties in the analysis of posttranslational modi- 
fications by mass spectrometry. 

CHROMATOSOME STRUCTURE 

Histones are evolutionarily conserved proteins responsible 
for condensation, organization and regulation of the 
DNA within the nucleus of all eukaryotes. The basic struc- 
tural element of DNA compaction, the nucleosome core 
particle, is made up of superhelical DNA wrapped about a 
protein octamer composed of two copies of each core 
histone H2A, H2B, H3 and H4 (1^4). Structurally, each 
core histone has a long central helix with a helix-strand- 
helix motif on each end forming what is termed the 
histone fold (5). Hydrophobic interactions between two 
core histone monomers form heterodimers in a head- 
to-tail configuration called the handshake motif (2-7). 
The heterodimers of histones H3 and H4 further associate 



to form tetramers (5,6). The histone octamer is assembled 
from two H2A-H2B dimers binding opposite the H3-H4 
tetramer (7). Micrococcal nuclease digestion of chromatin 
exposed to increasing salt concentrations shows symmet- 
rical association of ~ 146 base pairs of left-handed super- 
helical DNA wrapped ~1 .65 turns around the histone 
octamer forming the nucleosome core particle (5,8-12). 
Crystallography orients the histone octamer with the 
H3-H4 tetramer centered between and in direct contact 
with the DNA entry and exit points and the H2A-H2B 
tetramer centered opposite. Higher-order chromatin struc- 
tures are produced through the binding of a linker histone, 
histone HI, to the nucleosome core particle to form the 
chromatosome (13-16). 

Nucleosomal stabilization facilitated by the 
chromatosome is provided through the binding of 
histone HI to the nucleosomal dyad and the linker 
DNA entering and exiting the core particle (16-26). 
Recent »OH radical footprinting experiments show that 
the positioning of histone HI at the nucleosomal dyad axis 
protects an additional 20 base pairs of DNA, 10 base 
pairs from both the entering and exiting linker DNA, 
from micrococcal nuclease digestion (8,10,17,25,26). 
Additional experimental evidence illustrates the influence 
of histone HI on chromatin arrangement and compaction 
(14,19,27-33). However, the specific folding of the 30-nm 
filament remains controversial and potentially variable in 
nature (32). In any case, recent studies suggest histone HI 
binding provides stabilization and protection through the 
formation of a dynamic and polymorphic linker histone/ 
linker DNA stem structure (25,26,30,32). Stem-to-stem 
interactions of neighboring nucleosomes are hypothesized 
to stabilize folding into higher-order chromatin fibers (26). 
No matter how the 30-nm chromatin fiber ultimately 
folds, the influence of histone HI is dependent on its 
unique structural characteristics. 



*To whom correspondence should be addressed. Tel: +1 614 292 6215; Fax: +1 614 292 4118; Email: parthun.l@osu.edu 

Correspondence may also be addressed to Michael A. Freitas. Tel: +1 614 688 8432; Fax: +1 614 688 8675; Email: michael.freitas@osumc.edu 

© The Author(s) 2013. Published by Oxford University Press. 

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.Org/licenses/by/3.0/), which 
permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. 



9594 Nucleic Acids Research, 2013, Vol. 41, No. 21 



HISTONE H1 STRUCTURE 

Histone HI has a tripartite structure containing an evolu- 
tionarily conserved central globular domain with flanking 
variable domains. X-ray crystallography of the globular 
domain of the avian erythrocyte linker histone H5 (con- 
sidered a member of the HI family) shows a winged-helix 
motif consisting of three alpha helices with a C-terminal 
beta hairpin (34). An antiparallel beta sheet is formed 
between the C-terminal beta hairpin and a short beta 
strand connecting the first and second alpha helices (34). 
Conformational studies on the globular domain of the 
erythrocyte linker histone show that H5 binds asymmetric- 
ally to two DNA duplexes through two clusters of highly 
conserved, positively charged residues on opposite sides of 
the globular H5 molecule (18,34). Initial positional studies 
of linker histone H5 on chicken nucleosomes illustrate the 
globular domain is located between chromatosomal 
terminal DNA and DNA near the dyad axis of the nucleo- 
some (20). However, more recent experiments using the 
globular domain of histone HI. 5 show binding at the 
DNA minor groove of the nucleosomal dyad axis (25). 
As a result, the globular domain has been shown to 
mediate the protection of 20 additional base pairs of 
linker DNA by the chromatosome (17,25,26). Although 
binding of the globular domain of histone H 1 can protect 
almost two full turns of superhelical DNA from micrococ- 
cal nuclease digestion, it is the flanking terminal regions of 
the linker histone that allow for the formation of higher- 
order chromatin structures (17). 

The amino terminus of histone HI is considered nomin- 
ally unstructured, as solution and X-ray crystallographic 
stuctures have yet to be determined (15). Based on 
sequence, the N -terminus can be divided into two sub- 
regions (35). The extreme N-terminal sequence is enriched 
in hydrophobic residues, whereas a highly basic portion 
resides close to the globular domain (35). The basic cluster 
has been linked to globular domain positioning and takes on 
an alpha helical structure in the presence of DNA, whereas 
the hydrophobic region remains uncharacterized (36,37). 
Although the N-temiinus of bovine thymus histone HI 
has been shown to be non-essential for the formation of 
higher-order chromatin structures, deletion of the 
N-terminal domain (NTD) of histone HI isoforms reduces 
the binding affinity for chromatin in vitro (36,38,39). 
Additionally, histone HI NTD swapping experiments 
between mouse Hl° and Hlc show exchange of their chro- 
matin binding affinities via fluorescence recovery after 
photobleaching analysis (40). These studies suggest the 
NTD of histone HI plays a role in proper binding to the 
nucleosome. However, additional studies are needed to char- 
acterize the functionality of the NTD of histone HI. 

Similar to the amino terminus of histone HI, the 
carboxy terminus lacks X-ray crystallographic resolution 
and is assumed to nominally be a random-coil, or intrin- 
sically disordered, in solution (41^5). In vitro data 
suggest that on DNA binding at physiological salt concen- 
trations, the C-terminal domain (CTD) of histone HI 
takes on a folded conformation dominated by common 
secondary structural components such as alpha helices, 
beta sheets, loops and turns (42,45-49). Recent data 



presented by Fang et al. support the formation of second- 
ary structure, as the carboxy terminus of histone HI 
settles into DNA helices, allowing for the formation of 
the nucleosome stem structure (49). Additionally, the 
interaction of the CTD with linker DNA has been 
shown to extend beyond the initial 20 base pairs 
entering and exiting the nucleosome (45). 

The CTD accounts for more than half the linker histone 
sequence, with ~40% composed of lysine, 20-35% alanine 
and 15% proline residues (43). Mutational studies on the 
CTD of histone HI suggest two distinct functional regions 
for DNA binding, two 24-amino-acid lengths, facilitate 
chromatin condensation (44,50). It is hypothesized the re- 
maining CTD length (~50 amino acids) is involved in 
protein-protein interactions (44). Support for this 
concept was recently shown through the binding of 
DNA methyltransferases (DNMT1 and DNMT3B) to 
the CTD of mouse histone HI by Yang et al. (51). 

The net positive charge imparted on the CTD from the 
high lysine content allows for regulation of higher-order 
chromatin structures through DNA backbone charge 
neutralization (52,53). This allows for low-affinity HI 
binding to give rise to the formation of secondary struc- 
ture in the CTD that permits high-affinity binding 
(17,36,38,49,50,52,54,55,). In addition to the globular 
domain, the secondary structure in the CTD enables the 
formation and stabilization of linker DNA into higher- 
order chromatin structures (17,25,26,36,38,54,55). The 
length, charge and number of posttranslational modifica- 
tion (PTM) sites of the C-terminal tails vary between 
histone HI isoforms, suggesting that individual HI 
variants may play distinct roles in the regulation of 
higher-order chromatin structure. 



HISTONE H1 GENE FAMILY 

The histone HI gene family in lower organisms is less 
evolutionarily conserved than that of the core histones. 
For example, in Saccharomyces cerevisiae, the sequence 
homology between Hhol, the S. cerevisiae histone HI 
homolog, and Homo sapien HI is 31% identical and 
44% similar, whereas histone H4 between the species is 
92% identical and 96% similar. Conversely, in higher- 
order organisms such as the Gallus gallus (chicken), the 
erythrocyte linker histone, H5, shows high sequence 
homology (66%) to the human histone H1.0, with the 
greatest sequence divergence found in the CTD (56). In 
addition to sequence variation, histone HI proteins also 
display a range of structures. For instance, 5". cerevisiae 
Hholp contains two globular domains, whereas 
Tetrahymena completely lacks a globular domain 
(57,58). Eukaryotes also differ in the number of histone 
HI variants present. H. sapiens and Mus musculus both 
have 11 distinct variants, whereas Caenorhabditis elegans 
has eight and Xenopus laevis has five (59). The H. sapien 
family of histone HI proteins contains five somatic 
variants (Hl.l, H1.2, HI. 3, H1.4 and HI. 5), which are 
expressed in nearly all cells (60-62). Six additional HI 
variants have been identified in specific tissues, such as 
Hit and H1T2 in the testis, or cell types, such as H1.0 
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in terminally differentiated cells (56,63-71). Two types of 
histone HI genes exist in human cells: replication-inde- 
pendent and replication-dependent genes. The replica- 
tion-independent HI genes, such as histone H1(0), 
exhibit a replacement phenotype (72). These replacement 
histone His are genomically isolated from other histone 
genes with transcription based on cellular status (72). 
Whereas replication-independent His have been 
observed throughout the cell cycle, the majority of the 
histone HI protein is produced during S phase of the 
cell cycle (73-77). 

In contrast to replication-independent HI expression, 
the replication-dependent variants are found in a large 
cluster alongside many of the core histone genes located 
on the short area of chromosome 6 (6p21-p22) (62,78,79). 
These histone HI genes, located in gene cluster HIST1, 
have paired expression with DNA replication and core 
histone mRNA expression levels, although specific HI 
variants have been shown to have fluctuating expression 
across S phase of the cell cycle (77,80-84). The mRNA of 
replication-dependent HI genes lack a poly(A) tail and 
introns commonly observed in other protein-coding 
genes (85,86). Alternatively, somatic HI genes contain a 
3' stem-loop sequence allowing for rapid translation 
during DNA replication, while permitting tight regulation 
of gene expression after the conclusion of S phase 
(83,86,87). The expression patterns of individual HI 
variants are essential to the functional properties of HI 
in the chromatin regulatory system. 

In addition to the expression of the normal somatic 
histone variants, several histone HI sequence variations 
have been described. Initially, two sequence variants 
were described in K562 and Raji cells (88). In the K562 
cell line, an alanine to valine substitution is observed at 
position 17 of histone H1.2 (H1.2A17V) (88). A histone 
HI. 4 sequence variant was found in the Raji cell line cor- 
responding to a lysine to arginine substitution at position 
173 (H1.4K173R) (88). Finally, an alanine to threonine 
substitution at position 142 on histone HI. 2 was described 
by mass spectrometry (MS) in HeLa S3 cells (89). 
Although identified, the function of the sequence vari- 
ations remains unknown. 

Overexpression of histone HI variants shows functional 
differences between the isoforms. Experiments over- 
expressing histone Hlc and H1(0) in the mouse 3T3 
cell lines led to distinct phenotypes in these cells. 
Overexpression of H1(0) results in an increase in nucleo- 
somal repeat length and a decline in cell cycle progression 
(90,91). Conversely, overexpression of murine histone Hlc 
gives rise to an increase in or no change in transcription 
levels, while conferring no effect on cell cycle progression 
(91). These overexpression experiments show functional 
differences between the two variants, although additional 
experiments with other isoforms are still needed to further 
elucidate HI function. 

HISTONE H1 DYNAMICS 

Histone HI binding to chromatin has been shown to be 
dynamic in nature, with specific HI variants divergent in 



their binding affinity for chromatin (54,55,92-94). It is 
thought that a high percentage of the total nuclear HI is 
bound to nucleosomes at any given time; however, these 
interactions are individually transient (54,55). Data pre- 
sented by Lever et al. demonstrate in vivo dynamics of 
histone Hl.l occur through soluble intermediates, giving 
rise to a rapid "stop-and-go" movement of Hl.l in the 
nucleus between random binding sites (54). Others have 
further demonstrated that the transient binding of HI 
variants with nucleosomes is affected by the structure of 
the HI variant, PTMs present on HI and competition for 
chromatin binding by other nuclear factors. 

Histone HI, as described earlier, has a tripartite struc- 
ture. Of these, the CTD is the primary determinant of the 
binding dynamics of each specific variant. For example, 
fluorescence recovery after photobleaching experiments 
using NTD green fluorescent protein-tagged HI variants 
(H1.0-H1.5) show that the variants with the shortest 
CTDs have the shortest residence times on nucleosomes 
(93). Additionally, Th'ng et al. show through CTD 
swapping between Hl.l and HI. 4 or HI. 5 and truncation 
experiments with HI. 5 that the CTD determines the 
in vitro binding affinity for the nucleosome (93). 
Similarly, others have shown truncation of histone 
Hl.l's CTD reduces the residence time of the variant on 
the nucleosome ~10-fold in vivo (38,54). While CTD 
length clearly affects variant nucleosomal residence 
times, the number of phosphorylations and phosphoryl- 
ation sites also play a role. 

Phosphorylation of histone HI has many distinct func- 
tions, leading to both chromatin condensation and 
decondensation dependent on the site of phosphorylation 
and cell cycle context. Histone HI phosphorylation has 
been shown to progressively increase as a cell progresses 
from Gi to mitosis during the cell cycle (95-101). The 
overall importance of histone HI phosphorylation was 
highlighted by several studies showing that changes in 
histone HI phosphorylation can prevent entry into 
mitosis, thus linking histone HI phosphorylation with 
the cell cycle (102,103). The phosphorylation of histone 
HI during the cell cycle has been theorized to be a two- 
fold process (92). First, an interphase (G 0 -S phase) partial 
phosphorylation that allows for chromatin relaxation and 
facilitates transcriptional activation (104-107). Second, a 
maximal phosphorylation during mitosis (M phase) allows 
for chromatin condensation and separation of chromo- 
somes into daughter cells (95-101). The partial phosphor- 
ylation observed in interphase has been shown to induce 
structural changes in the CTD of H 1 , which in turn leads 
to a decreased affinity of histone HI for DNA (108). 
Mutational studies mimicking histone HI phosphoryl- 
ation have been shown to change the chromatin histone 
HI dynamics (109). Additionally, decondensation of chro- 
matin at DNA replication forks has been shown to be a 
result of histone HI phosphorylation by cyclin-dependent 
kinase (CDK) 2 (110). To this end, work by Talasz et al. 
and Sarg et al. with histone HI. 5 suggests interphase phos- 
phorylation only occurs on serine residues at SPK(A)K 
sequences (H1.5S17p, H1.5S172p and H1.5S188p) 
(111,112). Zheng et al. have supported this argument by 
demonstrating HI. 2 and HI. 4 have serine-only 
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phosphorylation during interphase by MS (H1.2S173p, 
H1.4S172p and H1.4S187p) (89). The identified interphase 
phosphorylation sites remained in the mitotic fraction, 
suggesting preferential hierarchy of phosphorylation on 
these HI variants (89). Collectively these studies support 
the model that interphase phosphorylation on specific 
histone HI variants can disrupt DNA-histone inter- 
actions, allowing for chromatin relaxation through 
histone HI mobilization, and allow for competition and 
regulation of binding sites on DNA by other nuclear 
proteins (110,113). 

The dynamic nature of histone HI during interphase 
allows for regulation of DNA access through several 
mechanisms (114). First, through condensation of chro- 
matin, histone HI can limit access of other proteins to 
chromatin. Lee et al. have shown that phosphorylation 
of histone HI, mimicking HI removal from chromatin 
and decondensation, allows for glucocorticoid-induced 
transcription of the mouse mammary tumor virus pro- 
moter (115). Additionally, phosphorylation of histone 
HI has been shown to disrupt the interaction between 
itself and heterochromatin protein la, leading to chroma- 
tin decondensation (116). Second, histone HI -bound nu- 
cleosomes can limit access of chromatin remodeling 
complexes. For example, the activity of ATP-dependent 
SWI/SNF chromatin remodeling complexes are reduced 
and altered when nucleosomes are bound to HI 
(117,118). Additionally, this reduction in SWI/SNF 
activity can be rescued by phosphorylation of histone 
HI, suggesting a role for histone HI phosphorylation in 
chromatin remodeling (119). However, data by Maier 
et al. and Clausell et al. have shown that chromatin re- 
modeling complexes can remain active even in the 
presence of linker histone (24,120). These data suggest 
specific remodeling complexes can access key nucleosomal 
elements without the removal of the linker histones. Next, 
stabilization of the nucleosomal positioning by histone HI 
limits the rotational access of specific DNA sequences to 
transcription factors and other nuclear proteins. This prin- 
ciple was demonstrated by Cheung et al., who showed that 
estrogen receptor a-mediated transcriptional activity is re- 
pressed by HI via decreased promoter accessibility (121). 
However, others have demonstrated transcriptional acti- 
vation of the mouse mammary tumor virus promoter 
after histone HI phosphorylation, suggesting a rescue of 
transcription can be achieved by histone HI phosphoryl- 
ation (115,122,123). Finally, histone HI binding sterically 
inhibits access of other factors to the chromatin. Herrera 
et al. have demonstrated histone HI sterically occludes 
histone acetyl transferase complexes from acetylating 
the N-terminal tail of histone H3 (124). Whereas inter- 
phase phosphorylation of histone HI is largely involved 
in transcriptional regulation, mitotic phosphorylation 
yields a condensed chromatin state allowing for cell 
division. 

The second phase of histone HI phosphorylation occurs 
during mitosis. Similar to interphase phosphorylation, 
mitotic phosphorylation has been shown to be primarily 
a result of CDK activity at sites of S/TPXK consensus 
sequences, although non-CDK mitotic phosphorylations 
have also been identified (Table 1). First described in the 



1970s, mitotic phosphorylation of histone HI is a maximal 
phosphorylation resulting in the condensation of chroma- 
tin (95-101). Several studies by Deterding et al. using MS 
have shown reduction in variant- specific histone HI phos- 
phorylation in response to therapeutics (CDK inhibitors) 
and hormones (dexamethasone) (145-147). Furthermore, 
Th'ng et al. have shown through the use of the kinase 
inhibitor staurosporine that the hyperphosphorylation 
of histone HI observed on mitotic chromatin is 
required to retain condensed chromatin structures (102). 
Additionally, they established that the inhibition of the HI 
kinase by staurosporine arrests cells at the G 2 /M transi- 
tion, preventing progression into mitosis (102). This study 
and others, such as those seen with the topoisomerase in- 
hibitor VM-26, emphasize the importance of histone HI 
phosphorylation in cell cycle progression (103,148,149). 
Collectively, these studies suggest the potential for 
histone HI kinase inhibitors as cancer therapeutics. 

An important aspect of histone HI dynamics that 
remains unresolved is the degree to which histone chaper- 
ones control the dynamics and assembly of the linker 
histones. Due to their exceptionally high degree of 
positive charge, the histone proteins can form indiscrim- 
inate and deleterious complexes with negatively charged 
species in the cell such as nucleic acids. A key function of 
the class of proteins known as histone chaperones is 
thought to prevent these inappropriate interactions. 
Although a large number of chaperones have been 
demonstrated to play a role in core histone transit and 
assembly, it is not clear whether the movements of 
histone HI in the cell and its association with chromatin 
are mediated by other protein factors or whether they 
occur spontaneously (150). One potential histone HI 
chaperone is the human protein nuclear autoantigenic 
sperm protein (NASP). NASP has been shown to be 
associated with both linker and core histones in the cell 
(151,152). In vitro, NASP is capable of binding to histone 
HI withnM affinity and to transfer HI molecules to DNA 
(153-155). However, a role for NASP in the cellular 
dynamics of histone HI has not been directly 
demonstrated. 



HISTONE H1 POSTTRANSLATIONAL 
MODIFICATIONS 

Although both interphase and mitotic phosphorylation- 
specific sites have been observed by MS, only a small 
number of sites have been functionally examined. For 
example, phosphorylation of HI. 4 Ser27 (H1.4S27p) by 
Aurora B kinase blocks the binding of heterochromatin 
protein lot to methylated Lys26 (H1.4K26me), suggesting 
a cross-talk between these modifications (156,157). Zheng 
et al. showed that interphase phosphorylation at Serl73 
on H1.2 (H1.2S173p) and Serl87 on H1.4 (H1.4S187p) is 
localized to the nucleoli of HeLa S3 cells (89). 
Phosphorylated Serl87 (H1.4S187p) was further shown 
to localize to active rDNA promoters, and phosphoryl- 
ation at this site can be induced by dexamethasone treat- 
ment (89). Ser35 phosphorylation on histone HI. 4 
(H1.4S35p) by protein kinase A mediates HI. 4 removal 
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from the mitotic chromatin, suggesting a mechanism of 
histone HI mitotic dynamics (158). However, these few 
examples are not the only sites characterized, and many 
sites of histone H 1 phosphorylations have yet to be func- 
tionally described in a site-specific manner. 

Although histone HI phosphorylation is the most re- 
searched PTM, other PTMs such as acetylation, methyla- 
tion and ubiquitination have also been identified (Table 1). 
The functional relevance of non-phosphorylation PTMs 
on histone HI is just coming to light. For example, 
lysine acetylation at position 34 on histone HI. 4 
(H1.4K34ac) by the histone acetyltransferase GCN5 has 
been linked to transcriptional activation and increased 
dynamic mobility in vitro (159). Additionally, further 
evidence for histone modification cross-talk was shown 
by the ARTD1 -mediated PARylation of histone H3, 
which induces a shift in specificity of the methyltransferase 
SET7/9 from H3 to histone HI (160). Kassner et al. 
further identified new sites of histone HI. 4 methylation 
at Lys-Ala-Lys motifs not previously described (160). 
The role of other non-phosphorylation PTMs on histone 
HI function and dynamic mobility is yet to be explored. 



EXTRACHROMATIN H1 FUNCTION 

Beyond the function of histone H 1 on chromatin conden- 
sation, histone HI (specifically HI. 2) has been found to 
have an extrachromatin function. Konishi et al. found 
translocation of histone HI. 2 to the cytoplasm in 
response to X-ray-induced DNA double-strand breaks 
(161). Furthermore, Gine et al. show cytosolic 
movement of HI. 2 in chronic lymphocytic leukemia cells 
after therapeutic intervention (162). The cytosolic histone 
HI. 2 was shown to induce apoptosis through a Bak- 
mediated mitochondrial release of cytochrome C, 
allowing for caspase activation (161,162). However, cyto- 
solic histone HI. 2 has been observed in non-apoptotic 
cells as well (14,63) (unpublished data). Collectively, 
these results suggest there is a mechanism of regulation 
for histone HI. 2 apoptotic induction beyond localization 
of HI. 2 in the cytoplasm. Data presented by Green et al. 
and our own unpublished data suggest HI isoforms are 
phosphorylated in the cytoplasm of non-apoptotic cells, 
giving an underlying potential for apoptosis regulation or 
chaperone binding (163). 



CHALLENGES OF HISTONE H1 ANALYSIS 

Although the work described previously illustrates the 
successes of research focused on histone HI, progress has 
been limited for several reasons. Availability and specificity 
of immunological reagents for histone HI are drastically 
lacking. As methods using antibodies are the primary 
means of molecular and biochemical investigation, limita- 
tions in quality antibodies have caused severe difficulty in 
study. As a result, a lag in understanding of the biological 
function of histone H 1 and its PTMs persists. The produc- 
tion of immunological reagents is hindered by several 
factors. The demand for histone HI antibodies remains 
generally low. A recent search of PubMed for primary 
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Figure 1. A search of PubMed for primary research articles containing histone HI or histone H3 in the title or abstract. Data show a steady-state 
low number (< 100) of publications for histone HI, whereas histone H3 displays an increasing trend over time. 



research articles with histone HI in the title or abstract 
across the past decade revealed a steady-state number 
(<100/year) of publications (Figure 1). A similar search 
for the core histone H3 showed an increase in publications 
over the past decade (Figure 1). These data suggest histone 
HI research has not seen the same explosive growth evident 
for the chromatin field. 

We suspect the lull in histone HI publications can be 
attributed to two interdependent factors. First, as discussed 
earlier, histone HI displays much lower level of evolution- 
ary conservation than the core histones, suggesting that 
linker histones are not as fundamentally important to chro- 
matin biology. Interest in histone HI and histone HI 
reagents was also dampened by initial studies that sug- 
gested that the linker histones were not essential for cell 
viability. For example, loss of histone HI genes in 
M. musculus, T. thermophyla and 5. cerevisiae did not 
affect viability (164-166). However, the essential role of 
histone HI in mammals was subsequently affirmed in a 
landmark publication from the Skoultchi laboratory. 
Whereas initial single knockout experiments of Hlc, Hid 
and Hie showed no apparent phenotypical changes, a triple 
knockout of Hlc, Hid and Hie (H1.2, HI. 3 and H1.4 
mouse orthologs) resulted in a 50% reduction in the HI/ 
nucleosome ratio and nucleosome repeat length, ultimately 
leading to embryonic lethality by El 1.5 (167,168). These 
data suggest a key structural role of histone HI is to keep 
the nucleosomes adequately spaced (168,169). Additional 
in vitro work by Fan et al. showed triple-Hl knockout in 
embryonic stem cells leads to substantial changes in chro- 
matin structure with variations in gene expression localized 
to sites of genes regulated by DNA methylation (27). 
Findings by Yang et al. have confirmed this work by 
showing histone HI can interact with DNA-modifying 
enzymes such as DNMT1 and DNMT3B to alter gene tran- 
scription (51). Furthermore, Lee et al. have shown a 
complex of histone Hlb (HI. 5 mouse ortholog) and the 
MSX1 transcription factor represses mesoderm differenti- 
ation through the regulation of the MyoD gene, suggesting 



a role for specific HI isoforms in the development of muscle 
(170). 

Second, the high sequence homology between variants 
of histone H 1 hinders the ability to produce high-specifi- 
city antibodies for individual variants. For example, 
CLUSTAL 2.1 alignment of the four most common 
somatic histone HI variant sequences shows high amino 
acid sequence conservation (Figure 2A) (171). Pairwise 
scoring of the sequence alignments between variants 
shows 74-87% sequence homology (Figure 2B). Domain 
analysis and sequence alignments show the divergence in 
the sequences of the HI variants is primarily located at 
the amino and carboxy termini of the HI molecule 
(Figure 2A). As a result, distinction between variants of 
histone HI would require partial identification of epitopes 
from one of these two domains. However, the high 
number of PTMs on the terminal tails of histone HI 
adds additional complexity that could alter the affinity 
of antibodies for a significant fraction of the molecules 
in a cell. Despite these complications, there have been a 
number of recent successes (89,112,143,157,172-174). 
Importantly, the use of peptides based on the divergent 
sequences in the NH2-terminal tails of the HI variants has 
led to the production of variant-specific antibodies for 
both chicken and mammalian HI (173,174). In addition, 
the generation of phosphorylation-specific HI antibodies 
has begun to shed light on the signal transduction 
pathways that are involved in the modification of the 
linker histones. For example, Hergeth et al. generated a 
polyclonal antibody against phospho Ser27 of HI. 4 
(H1.4S27p) and demonstrated this phosphorylation was 
a result of Aurora B kinase activity (157). Additionally, 
the Lindner laboratory used the commercially available 
anti phospho-Thrl46 HI (H1.4T146p) antibody to 
identify this modification on condensed mitotic chromatin 
by immunofluorescence (111). Furthermore, Chu et al. 
generated a HI. 4 phospho Ser35 (H1.4S35) antibody to 
show protein kinase A-induced phosphorylation at this 
site causes removal of HI. 4 from the chromatin (158). 
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********************** *. 
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SLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK 117 
SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK 120 
*********** ******************************************** t * 

PKVKKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKA 177 
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PKAKKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKK-AKSPKKA 176 

PKAKKAGAAKAKKPAGAT — PKKAKKAAGAKKAVKKTPKKAKKP-AAAGVKKVAKSPKKA 177 
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KVA-KPKKAAKS — AAKAVK PKAAKP KWKPKKAAPKKK 213 

KTP-QPKKAAKSPAKAKAPK PKAAKPKSGKPKVTKAKKAAPKKK 221 

KAA-KPKKAPKSPAKAKAVK PKAAKPKTAKPKAAKPKKAAAKKK 219 

KAAAKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK 226 
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Pairwise scores are calculated by the number of identities between the two sequences, 
divided by the length of the alignment represented as a percentage. 

Figure 2. Sequence alignment of histone HI variants. (A) Amino acid sequence alignment for the histone HI variants HI. 2, HI. 3, HI. 4 and HI. 5. 
(B) Pairwise scores of sequence homology. The alignment shows a high homology between the human HI variants. 



The amino and carboxy terminal tails of histone HI 
variants are among the most abundantly posttranslation- 
ally modified sequences in the cell. For example, Figure 3 
depicts the known MS-identified PTMs for the histone 
HI. 4 variant. In line with the data in Figure 3, current 
literature shows multiple numbers of simultaneous 
PTMs on histone HI are regularly identified (125,175). 
These results suggest antibodies generated toward the 
tail domains of HI could result in low specificity 
based on the PTM combinations present on the HI 
molecule and the immunogen used for antibody gener- 
ation. This effect is similar to that seen with histone 
H4 and H3 modification-specific antibodies, where 
antibodies must be generated with distinct combinations 
of localized PTMs to retain specificity for the epitopes 
of interest. Consequently, MS has become widely used 
to analyze histone HI variants through the ability 
to bypass the limitations of immunological reagents. 



However, even MS has limitations when analyzing 
histone HI. 

Limitations in the use of MS for the analysis of histone 
HI result from the inability to use common shotgun prote- 
omic methods for analysis. For example, the most 
commonly used endoproteinase for shotgun proteomic 
studies is trypsin. In most commonly expressed soluble 
proteins, trypsin regularly yields peptides of 6-10 amino 
acids in length due to the relative abundance of lysine and 
arginine. Additionally, because trypsin cleaves at the 
C-terminal side of the basic amino acids, each peptide 
carries at least two sites for protonation, one at the 
N-terminal and one at the C-terminal lysine or arginine 
side chain. Thus, when such a peptide is fragmented via 
tandem MS, two singly charged ions are typically 
produced. These properties make trypsin ideal for 
yielding peptides of a mass and charge suitable for 
liquid chromatography-electrospray ionization-tandem 
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Figure 3. An illustration of the MS-identified posttranslational modifications on histone HI. 4. Asterisk denotes 7V-a-acetylation of the N-terminal 
residue after methionine removal. 
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Figure 4. A graphical representation of the tryptic peptide length (A) and a histogram of relative hydrophobicities (B) for histone H 1 .4, histone H4 
and bovine serum albumin. 



MS. Although ideal for most soluble proteins, trypsin 
does not work well for histone H 1 . Figure 4A is a graph- 
ical representation of the peptide lengths generated by an 
in silico digestion of histone H 1 .4, histone H4 and bovine 
serum albumin (BSA) with trypsin. The tryptic peptides of 
HI. 4 are very short, with most peptides 5 amino acids or 
less in length. A similar observation is seen with in silico 



tryptic digest of histone H4. However, BSA yields peptides 
of variable length more conducive to MS analysis 
and increased protein sequence coverage (Table 2). 
Additionally, the peptides that are generated by trypsin 
for histone HI. 4 have low relative hydrophobicities 
(Figure 4B, Table 2). This results in low retention of 
peptides on reversed-phase C18 HPLC columns. 
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Furthermore, of those peptides with hydrophobic 
properties, appropriate masses and peptide lengths gener- 
ally correspond to the globular domain of the protein. 
Collectively, these factors lead to poor sequence 
coverage when compared with more standard proteins 
such as BSA. As a result, the use of common bottom-up 
MS strategies with trypsin is limited. 

The use of alternative endoproteinases for middle-down 
proteomics also yields poor digestion results. For 
example, endoproteinase Glu-C cleaves proteins at the 
C-terminal side of glutamic acid in ammonium bicarbon- 
ate buffer. In silico digests of histone HI. 4, histone H4 and 
BSA with Glu-C, shown in Table 3, yield less- 
than-optimal peptide lengths and charges. Glu-C digests 
of histone HI. 4 result in long, highly charged peptides 
not readily suited for liquid chromatography-MS/MS 
analysis. Additionally, Glu-C cleavage of histone HI. 4 
gives a peptide encompassing nearly the entire CTD (aa 
116-219). Similar results are obtained from in silico digests 
of histone H4. Conversely, Glu-C digests of BSA yield 
many suitable peptides in length and charge for MS 
analysis. Collectively, these results suggest the amino 
acid sequence of histone HI does not lend to commonly 
used bottom-up and middle-down MS strategies. 

The inability to use bottom-up and middle-down 
approaches has drastically limited the ability to study 
histone HI via MS. However, top-down MS techniques, 
although limited, have been successfully applied to study 
histone HI PTMs. For example in Drosophila 
melanogaster, Bonet-Costa et al. used top-down MS/MS 
to map both single and multiple co-existing histone HI 
PTMs after collision induced dissociation or electron- 
capture dissociation (176). Although effectively applied 
to the single HI variant in Drosophila, top-down 
MS/MS on human HI is severely limited by the necessity 
for high protein purity, high concentration and separation 
of the multiple variants. As a result, others have used top- 
down MS to assess the relative abundance of histone HI 
PTMs without fragmentation (89,177,178). For instance, 
Wang et al. monitored changes in histone HI. 5 phosphor- 
ylation patterns after drug treatment in acute myeloid 
leukemia cell lines using intact mass MS (178). While 
giving the number of modifications and abundances, 
these approaches do not yield the specific location of the 
PTM as top-down MS/MS can. Despite these limitations 
in proteomic methods for the analysis of histone HI, 
adaptations of these methods in conjunction with state- 
of-the-art equipment has led to progress in the study of 
histone HI. 



FUTURE DIRECTIONS OF FIELD 



The immunological limitations for studying the function 
of histone HI and its PTMs make it a challenging field of 
research. Although progress has been made, overcoming 
these difficulties will require combinatorial mass spectral 
methods. The use of a top-to-bottom proteomics 
approach will facilitate targeted characterization of 
specific histone HI variants and PTMs of interest where 
a single MS method may fail. Site-directed mutagenesis 
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and the application of single and multiple reaction moni- 
toring experiments to histone HI variants will allow for 
further functional descriptions without the necessity for 
immunological reagents. Collectively, the use of such 
methods will unlock the specific cellular functions of 
each histone HI variant and their respective PTMs. 
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